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In atmospheric dispersion models of nuclear accident, the dispersion coefficients were usually obtained by 
tracer experiment, which are constant in different atmospheric stability classifications. In fact, the atmospheric 
wind field is complex and unstable. The dispersion coefficients change even in the same atmospheric stability, 
hence the great errors brought in. According to the regulation, the air concentration of nuclides around nuclear 
power plant should be monitored during an accident. The monitoring data can be used to correct dispersion 
coefficients dynamically. The error can be minimized by correcting the coefficients. This reverse problem is 
nonlinear and sensitive to initial value. The property of searching the optimal solution of Genetic Algorithm 
(GA) is suitable for complex high-dimensional situation. In this paper, coupling with Lagrange dispersion 
model, GA is used to estimate the coefficients. The simulation results show that GA scheme performs well 
when the error is big. When the correcting process is used in the experiment data, the GA-estimated results 
are numerical instable. The success rate of estimation is 5% lower than the one without correction. Taking 
into account the continuity of the dispersion coefficient, Savitzky-Golay filter is used to smooth the estimated 
parameters. The success rate of estimation increases to 75.86%. This method can improve the accuracy of 


atmospheric dispersion simulation. 
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I. INTRODUCTION 


Traditionally, coefficients of radionuclide dispersion were 
preset for nuclear accident. They are obtained from field 
campaign experiments, with the differences between mete- 
orological conditions in experiments and real-time meteoro- 
logical conditions being a major source of errors. In most 
dispersion models, Pasquill-Gifford (PG) curve established 
on Prairie-Grass-Field experiment [1] and similar dispersion 
coefficients system are used to represent the lateral and verti- 
cal plume spread rates. 

Generally these dispersion coefficient systems are used in 
conservative forecast and recommended by regulation. But 
when the coefficients are validated by tracer experiments in 
nuclear power plants (NPPs), the result showed that the dis- 
persion coefficients are much different with the ones fitted by 
the experimental data [2]. In an NPP, air concentration of ra- 
dionuclides should be monitored during an accident [3]. For 
more accurate dispersion coefficients, a viable way is to cor- 
rect the empirical ones with monitored data dynamically. The 
dispersion coefficients will be corrected only based on moni- 
tored data and will not be calculated by atmospheric stability 
and other information. 

Haupt proposed to coupled the genetic algorithm (GA) 
with Gaussian plume model and receptor model, so as to 
achieve a better estimation of pollutant’s dispersion [4]. 
Jeong used a similar technique to analyze the atmospheric 
dispersion of nuclides from an NPP [5, 6]. Allen used the 
assimilation estimation method to estimate wind-field infor- 
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mation in the dispersion model [7]. Most numerical simula- 
tions used ideal models. However, the correlation coefficient 
value between predicted values and observed values is less 
than 0.5, and value of observation has a significant noise [8]. 
Thus in the actual situation, this error should not be ignored. 
The above studies indicate that the source term estimation and 
dispersion parameter dynamic correcting are effective meth- 
ods to improve the accuracy of dispersion models. Nonethe- 
less, the impact of dispersion coefficient error on the result 
requires further study. 


To find out the performances with diverse errors of dis- 
persion coefficient, a scheme was established based on GA 
method and was validated by numerical simulation and Kin- 
caid experiment data [8]. 


Il. METHOD 


A. Lagrangian puff model 


Parameters estimation requires that the dispersion model 
must be suitable for complex meteorological conditions, and 
high computing efficiency. Compared to Gaussian plume 
model and particle model, Lagrangian puff model meets the 
requirements better [9]. Therefore, it was used as a forward 
model to calculate the pollutant distributions. 


The basic assumption of Lagrangian puff model is that se- 
quentially releasing of pollutant is treated as a series of Gaus- 
sian shaped puffs at a fixed rate. The concentration of pol- 
lutants in each puff fits Gaussian distribution. The total con- 
centration at some location is the sum of all puffs’ value at 
this point. The concentration distribution of a puff can be 
described as Eq. (1) 
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where Q is total concentration of the puff; (£e, Yc, Zc) are 
coordinates of the puff center; zjny is the height of inversion 
temperature layer; (x, y, z) are coordinates of the observation 
point; and (oz, Cy, Cz) are dispersion parameters of the puff. 

There are many empirical formulas to compute the disper- 
sion parameters according to the atmospheric stability and 
downwind distance, among which the PG curve, as described 
in Eq. (2), is most often used. 


Öz Soe 
Oy = Py: a, (2) 


Or = p x". 


The oz, Jy and o, are computed according to downwind 
distance x, and the PG coefficients py, qy and p, and qz, are 
determined by the wind-field and atmospheric stability. 


B. PG coefficients correction 


PG curve is derived from Prairie Grass field experiments. 
If the experimental conditions differ from the actual meteoro- 
logical conditions, errors will be introduced to the model. To 
minimize effects of the PG error, one way is using the origi- 
nal PG coefficients as initial values, and correcting them by 
the observed values. This process can be regarded as an op- 
timization by minimizing the difference between calculated 
values and observed values. 


C. Genetic algorithm 


The reverse model of atmospheric dispersion is strongly 
nonlinear and sensitive to initial values. Tan used the least 
square method to estimate the dispersion coefficients but 
most of the results are local optimal solutions [10]. This 
means that an effective global optimization algorithm is re- 
quired, and GA can just satisfy this requirement. Invented 
by J. Holland in 1975, GA is an efficient and parallel global 
search method. 

Fitness function (ft) is an essential issue in GA. It deter- 
mines differences between the observed and predicted value. 
The smaller the ft is, the closer between observations and 
predictions. On the other hand, an observation’s weight is 
different from each other due to the diversity of observing 
conditions. An appropriate fitness function should comply 
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with two principles: 1) for observation station 7, the fitness 
function value should increased with the differences between 
the observed value O; and calculated value C;; 2) observa- 
tion stations should be given different weight in the fitness 
function due to their data convince. 

Ji et al. compared four types of fitness function, and found 
that concentration-related weight of observations is better 
than equal weight [11]. The fitness function can be con- 
structed as Eq. (3) on basis of the least square method 


f=] amum 3) 
>D O; 
4=1 


where, N is the number of observation stations; C; is the pre- 
dicted value in Station i; and O; is the observation value in 
Station 7. This fitness function is a modification for equal 
weight of every station. The weight is related to the concen- 
tration value of observations. 


I. NUMERICAL SIMULATIONS 


The numerical stability of this estimation scheme was val- 
idated by numerical simulations. White noise (a sequence of 
serially uncorrelated random variables with zero mean and 
finite variance) was added to the observations which were 
calculated by the model. Based on modified observations, 
GA was used to estimate the PG coefficients. Performance 
of the scheme was evaluated by comparing the ‘true’ PG 
coefficients with the estimated ones. 


A. Setup of dispersion parameters 


To simplify the calculation, in a Cartesian coordinate sys- 
tem, the release point was set to (0, 0), height is 187 m, release 
rate was one unit, and tracer gas was SF,. Wind was stable 
with one direction. Atmospheric stability was D class. The 
original PG coefficients, PG'™*, were set as Eq. (4). The 
changing of PG coefficients were estimated based on the ob- 
servation value in each simulation step. 


pm = 1.503, gi = 0.833, 


4 
pee = 0.151, gi = 1.219. 9 


B. Configuration of observations 


Signal-to-noise ratio (SNR) is defined as the ratio of signal 
power to the noise power: 


SNR = Osignal / noise . (5) 


Different SNR levels of 50, 25, 5, 2.5 and 0.25 were added 
to every sampling, including positive infinity. The noise is 
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white noise, which is a random signal with a constant power 
spectral density. According to the modified observations, the 
PG‘ was estimated to analyze the impact of the error. 

Initial values of parameters were set as Eq. (6), with every 
parameter added some noise: 


py = 1.533, gq" = 0.712, 


re re (6) 
pe? = 0.255, q? = 1.401. 
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GA with the fitness function corrected these PGP to 
achieve PG‘°". Less differences between PG" and PG™° 
means better performance of the fitness function. 


C. Result of numerical experiment 


The error between PG" and PG""* is expressed by Eq. (7) 


Tpq = T (logs py" — logs pyr)? + (ap™ — gy")? + (logs pe" — loga pe)? + (ag" — q3")? (7) 


As shown in Fig. 1, the error of estimation parameters in- 
creased with 1/SNR. The variance of white noise is 1/ SNR, 
and at 1/SNR = 1, dpq < 1.5, which means the PG°°" is 
very close to PG™"®. It indicates that this scheme has good 
robustness. 


3.5 


3.0 


1/SNR 


Fig. 1. Correction result with diverse errors. 


IV. EXPERIMENT VALIDATION 


The simulation shows that parameters of the dispersion 
model can be corrected on-line by GA. But in a tracer ex- 
periment, conditions differ from the simulation. Being not 
the ideal white noise, the errors distribute disorderly, and are 
larger and time-dependent. So, Kincaid tracer experiment 
data were used to validate the effectiveness of GA. 


A. Kincaid tracer experiment 


The Kincaid field experiment was performed as part of the 
EPRI Plume Model Validation and Development Project. A 
very comprehensive experimental campaign was conducted 
in 1980 and 1981. The Kincaid NPP is situated in Illinois, 
USA (39.59°N, 89.49°W). The terrain is at an elevation of 
~180 m, thus in the model the terrain effect is not considered. 


The NPP has a 187 m stack of P9 m. In the experiment, SF, 
was released from the stack. The tracer releases started sev- 
eral hours before the sampling. There were ~350h of tracer 
experiments in the experimental campaign. 

The 1284 arc-max concentration data, of “high quality” 
and classified artificial, are usually used for model validation 
and development. However, for this estimation, data should 
not be screened. In a real accident, there is not enough in- 
formation to judge the effectiveness of data. All the data was 
used, which introduced uncertainty to the estimation. 


B. Validation method 


There were 20 cases of experiment, each lasted 3h to 9h. 
Over 100 observation stations recorded the concentration of 
SFe in the air during the release period. The statistical val- 
ues, FA2, FAS, FB, NMSE, CORR and BIAS, as described as 
below, are used to measure the differences between predicted 
and observed values. 


FA2 = fraction of data satisfying 0.5 < Co/Cp < 2, 
FA5 = fraction of data satisfying 0.2 < C,/C, < 5, 


FB = (C, — C,)/[0.5(C, + ©), 
i (8) 


BIAS = C.-C), 


where, p denotes model prediction; o denotes observation; 
over bar denotes averaged dataset and ø is the mean square 
error over the dataset. 


V. RESULTS AND DISCUSSION 
A. Results of GA estimation 


The results are shown in Table 1. In the ‘Original data’ row, 
the correlation coefficient (CORR) is 0.20, indicating a weak 
positive relationship between the data of prediction and obser- 
vation; the FA2(0.11) and FA5(0.26) mean that the differences 
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Fig. 2. Smoothing filter of estimated PG coefficients (a) py, (b) pz (c) qy and (d) q- (O: original data; A: smoothed data). 


TABLE 1. Influence of PG correction on forecast ability of disper- 
sion model 


NMSE BIAS CORR FA2 FA5 FB Success 

rate (%) 
Original data 24.61 —0.87 0.20 0.11 0.26 —1.05 75.86 
GA 22.97 —0.13 —0.03 0.18 0.37 0.43 69.54 
GA Prediction 25.57 —0.07 —0.02 0.15 0.34 0.47 70.11 
GA smooth 31.19 —0.02 —0.01 0.15 0.32 0.65 75.86 


are big. In view of this, estimating PG coefficients is neces- 
sary to improve the model quality. Using GA method (the 
‘GA’ row) increases the FA value and decreases the BJAS and 
FB values. Therefore, after the dynamic correction, the model 
is more accurate and efficient with the large errors of disper- 
sion coefficients. 


B. GA Prediction 


The prediction process uses the parameters that are fitted 
based on the historical data. For every case, the steps are as 
follows: 


1) With the standard PG parameters as prior value, correct 
the parameters using GA in every step. 

2) Calculate the prediction value based on the last step of 
the corrected parameters. 

3) Calculate the prediction value based on the standard PG 
parameters. 

4) Compare the prediction observations between Steps 2 
and 3, it will tell the effect of parameters predicted by GA. 

The results are given in Table |. Most of the data worsened 
because the calculation of dispersion parameters in one step 
was not related to the next step. The result obtained by GA 
is only numerically best, not physically best. The continuing 
property of the PG coefficients was ignored. 


C. Smoothing filter 


Though the statistics values are much better than the ones 
in original model, the GA-estimated results are numerical in- 
stable. Some estimated values were greatly deviated from the 
true value. The success rate of estimations was 5% lower 
than the one without correction. To reduce impact of the 
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Fig. 3. Statistics values of (a) BIAS, (b) CORR, (c) FA2, (d) FAS, (e) FB and (f) NMSE, of case 50780 for estimation schemes of GA (e), GA 


Prediction (A) and GA Smooth (¥), comparing with the original data ( 


fluctuations in GA estimation, it assumed that the dispersion 
parameters were of continuous changes, as shown in Eq. (9) 


Pylt +1] = Py [t] +7, 
dy [t +1] = qy lt] TN, 
pelt + 1] = pelé] +n, 
qz [t T 1] = qz [t] T, 


). 


where 77 is white noise representing time-dependent changes 
of dispersion coefficients; t is the current time step and t + 1 
is the next time step. 

Thus, before their uses for prediction, the GA-estimated 
parameters were filtered smoothly with Savitzky-Golay filter. 
The main advantage of this method is that it tends to pre- 
serve features of the distribution such as relative maxima, 
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TABLE 2. The correlation coefficient between FAS and other statistics 


Step Station No. Op/Co Oplo NMSE BIAS CORR FA2 FA5 FB 
GA 0.025 0.20° 0.32° 0.16" -0.17° —0.49° 0.22° 0.35° 0.32° —0.43° 
GA prediction 0.16" 0.093 0.23° 0.23° —0.19" —0.33° 0.12 0.17" 0.26° —0.31° 
GA smooth 0.13 0.13 0.25° 0.32° —0.13 —0.47° 0.077 0.13 0.235 —0.43° 


aà Correlation is significant at the 0.05 level. 
> Correlation is significant at the 0.01 level. 


minima and width, which are usually ‘flattened’ by other ad- 
jacent averaging techniques. The information in the estimated 
dispersion coefficients will be reserved after Savitzky-Golay 
smoothing. 


Take 50780 case as example, the results after smoothing 
filter are shown in Fig. 2. The PG coefficients are more 
stable than before. It means that the distribution of nuclides 
changes smoothly step by step. Figure 3 shows statistics val- 
ues of the results for the three schemes. ‘GA smooth’ scheme 
effectively improves the results, and inherited excellent char- 
acteristics of ‘GA prediction’ scheme. It balanced the stabil- 
ity and efficiency, performed best of the three schemes. 


The statistical values of all the cases are shown in ‘GA 
Smooth’ row in Table 1. The NMSE, FA2, FAS and FB data 
are slightly worse than ‘GA’ and ‘GA Prediction’, but are still 
much better than those in the “Original data’. The success 
rate is equal to that of ‘GA’, indicating that it removed the nu- 
merical instability. The “GA Smooth’ scheme estimates the 
dispersion coefficients efficiently. 


D. Correlation test 


To analyze which factor determined the accuracy of esti- 
mations, correlation tests were performed between value of 
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